(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
Int^ational Bureau 

(43) International Publication Date 
10 January 2002 (10.01.2002) 




PCX 



liiiiiiiiiiiiiiiiiiiiiiiiin 

(10) International Publication Number 

wo 02/03601 Al 



(51) International Patent Classification^: H04L 1/20, 

GIOL 19/00 // H04B 7/26, H04Q 7/38 

(21) International Application Number: PCT/ROl/00560 

(22) International Filing Date: 13 June 2001 (13.06.2001) 

(25) Filing Language: English 

(26) Publication Language: Bnglish 



(30) Priority Data: 
20001577 



30 June 2000 (30.06.2000) ¥1 



(71) Applicant (for all designated States except US): NOKIA 
CORPORATION [FI/FI]; Keilalahdentie 4, FIN-02150 
Espoo (FI). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): MIKKOLA, 
Hannu [FT/FI]; Ippisenkatu 15, F[N-33300 Tampere (FI). 
VAINIO, Janne [FI/FI]; Simunantie 8-12 B 6, FIN-33880 
Lempaaia (FI). ROTOLA-PUKKILA, Jani [FIM]; 
LehvSnkatu 24 E 44, FIN-33820 Tampere (FI). 



(74) Agents: WALKER, Andrew et al.; c/o Nokia. Coipora- 
tion, P.O. Box 206, FrN-00045 Nokia Group (FI). 

(81) Designated States (national): AE, AG, AL, AM, AT, AT 
(utility model), AU, AZ, BA, BB, BG, BR, BY, BZ, CA, 
CH, CN, CR. CU, CZ, CZ (utiUty model), DE, DE (utility 
model). DK, DK (utility model), DM, DZ. EE, EE (utility 
model), ES, FI, FI (utility model), GB. GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
lr, ls, lt, lu, lv, ma, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI. SK, SK 
(utility model), SL, TJ. TM, TR, TT, TZ, UA, UG, US. UZ, 
VN. YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC. NL, PT, SE. TR). OAPI patent (BF, BJ, CF, 
CG, a, CM, GA. GN. GW. ML, MR, NE, SN, TD, TG), 

Published: 

— with international search report 

[Continued on next page] 



(54) Title: SPEECH CODING 



m 

o 
O 




Mobile StaUon (MS) 




Multi- 




MulU- 


Speech 




Rate 




Rate 


Pat? to 




Speech 




Channel 






Encoder 




Encoder 







Mode Command 




Speech Deco- 
Para- ded 
meters Speed* 



Multi- 
Rate 
Speech 
Decoder 



Speech 
Slonal 
Analysis 



Source 
Adaptation 



Mode Command 



(57) Abstract: A wireless telecommunications system comprises a mobile station (MS) and a network- The mobile station has. a 
multi-rate speech encoder which produces an encoded speech signal which is transmitted to the network. The network has a multi-rate 
speech decoder which decodes the encoded speech signal to produce a decoded speech signal. The network also comprises a signal 
analyser which measures speech characteristics of the decoded speech signal to produce speech characteristics information and an 
up-link mode control unit which receives the speech characteristics information and produces a mode command. The mode command 
is transmitted by the network to the mobile station where it is used to control the speech encoding bit rate of the multi-rate speech 
encoder. 
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SPEECHCODING 

The invention relates to adaptive multi-rate speech coding in wireless 
telecommunications networks. 

5 

In transmission of a speech signal between a mobile station (MS) and a network, it 
is important to optimise the coding of the signal both to provide robust 
transmission in order to deal with variability of channel quality over the air interface 
and to provide high quality speech. Transmission can be made more robust in 
10 poor channel conditions by increasing the amount of error correction provided in 
channel coding. Speech quality can also be increased by using a higher bit rate in 
speech transmission. Since the gross channel bit rate is generally fixed, a balance 
needs to be struck between the proportions of it which are assigned to channel 
coding and to speech coding. 

15 

The European Telecommunications Standards Institute (ETSI) has standardised a 
speech service which is intended to deal with this problem. This is referred to as 
the Adaptive Multi Rate (AMR) speech service and, in one form, is defined in ETSI 
standards. In the AMR speech service the bit rates assigned to speech and 
20 channel coding are adaptively varied based on the channel quality. This is done 
independently in both the up-link (mobile to network) and in the down-link (network 
to mobile) directions. The AMR speech service has also been chosen as the 
mandatoiy speech service for third generation (3G) telecommunication systems. 

25 Figure 1 sliows a wireless telecommunications system capable of providing an 
AMR speech service according to the standards referred to above. Figure 1 is split 
into two parts which are presented on separate sheetis of the drawings. Arrows A 
and B show how the two parts are connected to each other. The system 
comprises a network and a mobile station (MS). The network comprises a 

30 transcoder (TC) and a base transceiver station (BTS). A signal is transmitted from 
the MS to the network over an up-link radio channel and from the network to the 
MS over a down-link radio channel. 
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In the up-link direction, in the MS speech is speech encoded in a multi-rate speech 
encoder, channel encoded in a multi-rate channel encoder and the resulting 
encoded signal is then transmitted across the air interface. In the network, the 
encoded signal is decoded in a multi-rate channel decoder in the BTS and then 
5 speech decoded in a multi-rate speech decoder in the TC. 

In the down-link direction, speech is encoded in a multi-rate speech encoder in the 
• TC, channel encoded in a multi-rate channel encoder in the BTS and the resulting 
encoded signal is then transmitted across the air interface. In the MS, the encoded 
10 signal is channel decoded in a multi-rate channel decoder and then speech 
decoded in a multi-rate speech decoder. 

Both for up-Iink and down-link, information about the quality of the radio link is 
derived by estimating the current channel state. Based on the channel state, and 
15 also taking into consideration possible constraints from network control, such as 
network load, encoding and decoding are adaptively varied by codec mode 
control. The codec mode control, in the form of a link adaptation block containing 
up-link and down-link mode adaptors, is located on the network side. The codec 
mode control selects the codec modes to be applied. 

20 

. In this sense the term "codec" is referring to corresponding encoder/decoder pairs 
on opposite sides of the air interface. Therefore, in up-link, the codec is the multi- 
rate speech encoder and multi-rate channel encoder in the MS and the multi-rate 
channel decoder and multi-rate speech decoder In the network. In down-link, the 
25 codec is the multi-rate speech encoder and the multi-rate channel encoder in the 
network and the multi-rate channel decoder and multi-rate speech decoder In the 
in the MS. 

The channel mode which is used (TCH/AFS (adaptive full rate traffic channel for 
30 speech) or TCH/AHS (adaptive half rate traffic channel for speech) is controlled by 
the network. Up-link and down-link always have the same channel mode. 
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A quality indicator is derived at the side wliicli receives tfie transmitted encoded 
signal. Tfie quality indicator is derived by a quality measurement unit. 

For up-link adaptation, the quality indicator froni an up-link quality measurement 
5 unit is directly fed into an up-link mode control unit located in the network. This 
mode control unit quantises the quality infonnation according to certain threshold 
values and generates, taking into consideration possible constraints from network 
control, a Mode Command MCu indicating the codec mode to be used in the up- 
link path (in the up-link direction). The Mode Command is then transmitted in-band 

10 across the air interface to the MS and it is provided to the multi-rate speech 
encoder in the MS. The bit rate of speech encoding Is changed in response to this 
command. The multi-rate speech encoder provides a Mode Indicator Mlu to the 
multi-rate channel encoder in the MS. The bit rate of channel encoding is changed 
in response to this indication. The bit rate changes to the multi-rate speech 

15 encoder and to the multi-rate channel encoder are linked so that, in total, they 
equal the up-link channel gross bit rate. The Mode Indicator is then transmitted in- 
band across the air interface to the network and it is provided to the multi-rate 
channel decoder in the network. The bit rate of the channel decoder is changed in 
response to this Mode Indicator. The multi-rate channel decoder provides the 

20 Mode Indicator to the multi-rate speech decoder in the network (particularly in the 
transcoder of the network). The bit rate of the speech decoder is changed in 
response to the Mode Indicator. The bit rate changes to the multi-rate channel 
decoder and to the multi-rate speech decoder are adjusted so that they match the 
bit rates of the speech and channel encoders in the MS. Following these changes, 

25 incoming speech at the mobile is encoded using a codec mode which is suitable 
for the state of the up-link channel. 

For down-link adaptation, the quality indicator from a down-link quality 
measurement unit is used to generate a Mode Request MRa in a down-link Mode 
30 Request generator in a link adaptation block located in the MS. The down-link 
Mode Request generator generates the Mode Request by comparing the down- 
link quality indicator with certain thresholds. The Mode Request indicates the 
prefen-ed codec mode for the down-link path (in the down-link direction). The 
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Mode Request is transmitted in-band across the air interface to the networic where 
it is fed into a down-linic mode control unit This mode control unit generally grants 
the requested mode. The Mode Request may be overridden depending on 
possible constraints from network control. The Mode Request is then applied so 
5 that an incoming speech signal in the network is encoded in a way similar to that 
described above for the up-iink channel. 

Both for up-link and down-link, infonnation about the presently applied codec 
mode (mode indication) is transmitted in-band. The Mode Indicator is channel 
10 coded using a block code which is separate from speech data. In the receiving 
end, this code is first decoded in the channel decoder to determine the right 
decoding mode for speech data. At the multi-rate channel decoder, the Mode 
Indicator is decoded and applied for channel decoding and speech decoding. 

15 The thresholds which are used to compare against measured channel quality 
estimates and to decide when to change encoding modes are sent to the MS by 
the BTS. In total, there are 8 codec modes and a set of 4 codec modes are 
selected from these. Only one combination (codec set) of 4 particular codec 
modes can be used In a given direction at a given time. The codec modes differ by 

20 the proportions of the channel gross bit rate allocated between speech coding and 
channel coding (more robust transmission has lower bit-rate speech coding and 
more bits allocated to channel coding in order to deal with a larger amount of 
channel errors). Each codec set comprises a set of thresholds. The thresholds 
represent the optimum switching points between sequential speech codec modes. 

25 They are set to apply when the mode with the higher speech coding bit rate suffers 
from channel errors so much that the subjective quality drops to the level of the 
speech codec with the lower bit rate in the same channel conditions (fewer 
channel errors in the lower mode because of rhbre channel coding). Information 
enabling the codec modes to be set is sent by the BTS to the MS at the beginning 

30 of communication and from time to time thereafter. For example, the set of 4 
codec modes may be changed oh hand-over. This maybe in response to hand- 
over to a BTS that does not support the codec modes used previously. 
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The GSM AMR speech system operates on the existing GSM full-rate and half- 
rate speech traffic channels where the gross bit rates are fixed. 

The codec mode in up-link may be different from the one used in down-link, while 
5 the channel mode, whether half-rate or full-rate, must be the same. 

In the GSM AMR speech service, control is centralised in the network and the MS 
is not allowed to make autonomous decisions about the codec mode in the up-link 
direction. However, the network is able to determine the down-link codec mode. In 

10 doing this, the network may use the Mode Request or may use other information. 
The MS niust be capable of operating in all possible codec modes, that is 
encoding and decoding for all 8 codec modes has to be implemented in the MS. 
The network, on the other hand, does not have to provide all possible modes 
because the BTS sets the mode-set (a maximum of 4 modes) for the MS and 

15 there is no mandatory requirement for It to be capable of encoding and decoding in 
every mode. 

Other forms of codec adaptation exist. One form, referred to as source-controlled 
codec adaptation, is based on characteristics of the input signal. In this case, the 

20 bit rate and the parameters describing the speech signal are selected by the 
speech encoder. This may depend on whether the input signal is voiced/unvoiced 
and/or on the level of background noise. There exist well-known methods to adapt 
codec operation according to these parameters. A simple method is to measure 
the energy of the speech signal during speech pauses before it enters the speech 

25 encoder and to use the measurement obtained to select a higher or a lower bit 
rate. If the power is higher, higher bit rate encoding could be chosen and this 
would suggest a higher level of background noise. 

An example of a system that uses source-controlled adaptation is the IS-95 CDMA 
30 mobile telecommunications system. In this system, control is not centralised and 
the mobile station and the network can each change the bit rate of the speech and 
channel encoders autonomously according to the speech signal which is input at 
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the respective part of ttie system. The system does not have channel quality- 
controlled adaptation- . 

The reason for providing source-controlled codec adaptation is that coded speech 
6 quality degrades as a function of the bit rate, and the degradation is more distinct 
in the presence of the bacl<ground noise. Therefore, the optimal switcfiing point 
between different codec modes is different depending of the presence of 
bacl<ground noise. However, in systems having centralised control for adaptation, 
where a mobile station receives and must obey instructions to change its speech 
10 encoding bit rate from a mode control unit in the network, implementation of a 
mobile station controlled source-controlled codec adaptation, such as that used in 
18-95, is not possible. 

The current implementation of the GSM AMR speech service is an adaptive 
15 system controlled by channel quality measurement. There is no source-controlled 
adaptation. 

According to a first aspect of the invention there is provided a method of 
controlling adaptive multi-rate speech coding in a wireless teleconimunications 
20 system comprising a nriobile station and a network, the method comprising the 
steps of: 

measuring speech characteristics of a decoded signal produced by a multi-rate 
decoder located in the network; 

forming a mode command on the basis of the speech characteristics; 
25 transmitting the mode command from the network to the mobile station; and 

using the mode command to control the speech encoding bit rate of a multi-rate 
speech encoder located in the nriobile station. 

Preferably the method includes the step of producing a channel quality 
30 measurement. Most preferably a combination of the channel quality measurement 
and the speech characteristics is used to produce the mode command. If such a 
combination is used and it is desired to use the speech characteristics only, the 
weight of the channel quality measurement component may be set to zero. 
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Preferably the signal is a speech decoded signal. Alternatively the signal 
comprises speech parameters from a channel decoded signal. The signal may 
comprise both speech parameters and decoded speech. Preferably the quality 
5 measurement of the signal is based on one or both of speech parameters and 
decoded speech. 

Preferably the signal is output by a multi-rate channel decoder. The signal may be 
output by a multi-rate speech decoder. The signal may be output by both a multi- 
1 0 rate channel decoder and a multi-rate speech decoder. 

Preferably the mode command is formed by providing the speech characteristics 
measurement of the signal to a mode control unit. The mode control unit may then 
produce the mode command. 

15 

Advantageously, the invention may provide a source-controlled adaptation method 
in which the network instructs the mobile station to alter its speech encoding bit 
rate in changing channel conditions. This may be due to changing carrier-to- 
interference ratio (C/l) conditions or bit error rate (BER). 

20 

Preferably the mobile station is not permitted to autonomously change its codec 
mode without receiving the mode command from centralised network control. 

Preferably the mode command is used to control the channel encoding bit rate of a 
25 multi-rate channel encoder in the mobile station. Preferably a mode indication 
derived from the mode command is used to control the channel decoding bit rate 
of a multi-rate channel decoder in the network. Preferably the mode indication is 
used to control the speech decoding bit rate of the multi-rate speech decoder 
located in the network. 

30 

Preferably the method is used to control encoding and decoding in an up-link 
channel. 
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Preferably both source-controlled and channel quality-controlled adaptation of 
encoding and decoding bit rates are perfonmed. 

Preferably a channel quality measurement is made of a channel between the 
5 mobile station and the network. Both up-link and down-link channel quality may be 
measured. Up-link channel quality measurement may be made in the network. 
Down-link channel quality measurement may be made in the mobile station. 
Preferably the up-link channel quality measurement is combined with the speech 
characteristics measurement of the signal to produce the mode command. The 
10 speech characteristics measurement of the signal may be used to modify 
thresholds associated with transitions between encoding bit rates. 

Preferably the method is used to control encoding and decoding in tlie down-link 
channel. Preferably the down-link channel quality measurement is used in the 
15 mobile station to generate a mode request. Preferably the mode request is 
transmitted to the network and the network chooses whether to grant it. 

Preferably a measurement is made of speech characteristics of an input signal to 
a multi-rate speech encoder located in the network. The mode request may be 
20 modified by the measurement of speech characteristics of the input signal to 
produce a down-link mode command. This modification may occur in the network. 
The network may take other factors into account in producing the mode command. 
Preferably the mode command is used to control the speech encoding bit rate of 
the multi-rate speech encoder located in the network. 

25 

Altematively, the measurement of speech characteristics of an input signal is used 
to modify thresholds associated with transitions between encoding bit rates. 
Modified thresholds may be transmitted from the network to the mobile station 
using existing signalling methods. Layer 3 signalling may be used. This applies 
30 particularly if the invention is applied to a GSM system. Subsequent to this, a 
mode request may be transmitted to the network. 
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These embodiments enable source-controlled adaptation to be performed in 
conjunction with the channel quality-controlled adaptation using existing signalling. 

Preferably a down-link mode indication obtained from the mode command is used 
5 to control the channel encoding bit rate of a channel encoder in the network. 
Preferably the down-link mode inidcation is used to control the channel decoding 
bit rate of a channel decoder in the mobile station. Preferably the down-link mode 
indication is used to control the speech decoding bit rate of a speech decoder in 
the mobile station. 

10 

According to a second aspect of the invention there is provided a wireless 
telecommunications system comprising a mobile station having a multi-rate 
speech encoder and a network comprising a multi-rate decoder, a signal analyser 
which measures speech characteristics of a decoded signal produced by the multi- 
15 rate decoder to produce speech characteristics information and an up-link mode 
control unit which produces a mode command, the mode command being used to 
control the speech encoding bit rate of the multi-rate speech encoder in the mobile 
station. 

20 Preferably the multi-rate decoder is a multi-rate speech decoder. Alternatively it is 
a multi-rate channel decoder. 

Preferably the system comprises a plurality of mobile stations. 

25 According to a third aspect of the invention there is provided a network for 
controlling speech coding of a multi-rate speech encoder in a mobile station, the 
network comprising a multi-rate decoder, a signal analyser which measures 
speech characteristics of a decoded signal produced by the multi-rate decoder to 
produce speech characteristics information and an up-link mode control unit which 

30 produces a mode command, the mode command being used to control the speech 
encoding bit rate of a multi-rate speech encoder. 
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Preferably the multi-rate decoder is a mujti-rate speecli decoder. Alternatively it is 
a multi-rate channel decoder. 

Preferably, the signal analyser measureis speech characteristics of the decoded 
5 signal on the decoded signal being output by the multi-rate decoder. 

An advantage of the invention is that it can be used with mobile stations which do 
not, themselves, control source-controlled adaptation. Therefore, it is possible to 
introduce this feature in the network of a pre-existing wireless telecommunications 
10 system with only minor modifications. In this case, no changes to the mobile 
stations are necessary. 

According to a fourth aspect of the invention there is provided a method of 
controlling adaptivie multi-rate speech coding in a wireless telecommunications 
15 system comprising a mobile station and a network, the method comprising the 
steps of: 

measuring channel quality of a link from the network to the mobile station; 
measuring speech characteristics of a signal which is to be transmitted from the 
network to the mobile station; 
20 forming a mode command on the basis of the channel quality and the speech 
characteristics; 

using the mode command to control the speech encoding bit rate of a multi-rate 
speech encoder located in the network. 

25 According to a fifth aspect of the invention there is provided a method of 
controlling adaptive multi-rate speech coding in a wireless telecommunications 
system comprising a mobile station and a network, the method comprising the 
steps of : . 

measuring channel quality of a link from the network to the mobile station; 
30 measuring speech characteristics of a signal which is to be transmitted from the 
network to the mobile station; 

using the speech characteristics to up-date thresholds which are used to generate 
a mode request; 
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forming a mode command on the basis of tlie channel quality and the thresholds; 
using the mode command, to control the speech encoding bit rate of a multi-rate 
speech encoder located in the network. 

5 According to a sixth aspect of the invention there is provided a method of 
controlling adaptive multi-rate speech coding in a mobile station communicating 
with a network, the method comprising the steps of: 
measuring channel quality of a link from the network to the mobile station; 
receiving up-dated thresholds which are used to generate a mode request the up- 
10 dated thresholds having been up-dated in accordance with speech characteristics 
of a signal which is to be transmitted from the network to the mobile station; 
forming a mode request on the basis of the channel quality and the thresholds; 
sending the mode request to the network for the network to control the speech 
encoding bit rate of a multi-rate speech encoder located in the network. 

15 

An advantage of this aspect is that it is not necessary to modify a mode command. 
The mode command may be formed as the result of up-dated thresholds. The 
invention allows the use of accurate unquantised channel quality measurements 
available in the mobile station. If a mode request in the network is over-ridden, 
20 since there are generally only a few levels of channel quality available when using 
the mode request (which has been sent by the mobile station) the accuracy of the 
mode command Is compromised. However, the original unquantised channel 
quality in the mobile station can have many levels (64 to 256 or even more) and so 
a more accurate mode command is provided. 

25 

Other aspects of the Invention relate to networks and wireless telecommunications 
systems having functional units to carry out the method steps of the fourth and fifth 
aspects of the Invention. 

30 Embodiments of the invention will now be described by way of example only with 
reference to the accompanying drawings in which: 
Figure 1 shows a GSM system having the AMR speech service; 
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Figure 2 shows an implementation of the invention in an up-link path of a GSM 
system; 

Figure 3 shows a first embodiment of an implementation of the invention in a 
" down-link path of a GSM system; arid 
5 Figure 4 shows a second, alternative, embodiment of an implementation of the 
invention in a down-link path of a GSM system. 

In the following, the invention is described in relation to Figures which separately 
show blocks used in up-link and down-link. This is in order to present the invention 
10 clearly. 

Figure 2 shows a wireless telecommunications system capable of providing an 
AMR speech service. The system comprises a network and a mobile station (MS). 
Certain features of the network, such as a transcoder (TC) and a base transceiver 
15 station (BTS), are not shown in this Figure. However, it is to be understood that 
they are present in a system according to the invention. Those parts of the system 
involved in up-link AMR control will now be described. 

In an up-link direction, in the MS an input signal is speech encoded in a multi-rate 
20 speech encoder, channel encoded in a multi-rate channel encoder and then 
transmitted across the air interface. In the network the channel encoded signal is 
decoded in a multi-rate channel decoder and then speech decoded in a multi-rate 
speech decoder to provide a decoded speech signal. 

25 In the network, channel quality infbnnation is derived by estimating the current 
state of the up-link channel. The quality information is processed in an up-link 
quality measurement unit to produce a quality Indicator. The quality Indicator from 
the up-link quality measurement unit is fed into an up-link mode control unit 
located in the network. 

30 

At the output of the multi-rate channel decoder, speech parameters are taken and 
fed into a source adaptation block where speech signal analysis is carried out to 
determine speech characteristics. In addition, or alternatively, at the output of the 
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multi-rate speech decoder, a current frame of decoded speech signal (synthesised 
speech signal) is taken and is also fed into the source adaptation block where 
speech signal analysis is carried out to determine speech characteristics. For 
dealing with an input from each source (both the channel decoder and the speech 
5 decoder), determining the speech characteristics can include classifying the signal 
as voiced or unvoiced and/or estimating the background noise level of the signal. 
In the decoded speech signal, a voice activity detector (VAD) can be used for 
determining the level of the background noise during speech pauses. For the 
channel decoded speech parameters an LTP (long term prediction) lag parameter 

10 and LTP gain values can be used to classify the signal as voiced or unvoiced. For 
example, if the LTP lag parameter is relatively stable and if the LTP gain value is 
close to unity, it usually indicates a highly voiced signal and if the LTP lag 
parameter is changing rapidly and LTP gain value is small it usually indicates an 
unvoiced signal. In one embodiment, these measurements are combined and LTP 

15 lag parameters are only calculated during speech (when the VAD indicates that 
this is so) so that speech pauses are ignored. Also the spectral parameters in the 
form of LPC (linear predictive coding coefficients) can be used to estimate whether 
the signal is voiced or unvoiced. 

20 If only one of the speech parameters and the decoded speech Is used to obtain 
the speech characteristics, It is preferred that it be the decoded speech. 

The speech characteristics are sent by the source adaption block to the up-link 
mode control unit. 

25. 

Speech signal characterisation of the decoded speech occurs in the source 
adaptation block continuously. 

The up-link rhode control unit receives a third input comprising network constraints 
30 such as load In the network. Alternatively, if the mode cannot be changed because 
of some network constraint then transmission power in the up-link direction is 
increased rather than entering into a more channel-error robust mode. The up-link 
mode control unit takes the results of the speech signal analysis together with the 
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channel quality from the up-link quality estimator and then uses thresholds to 
make an optimised decision about which mode to use. This decision takes into 
consideration possible constraints from network control, such as network load. The 
up-link mode control unit produces a Mode Command MCu Indicating the codec 

5 mode to be used in the up-link path. The Mode Command Is then transmitted In- 
band across the air interface to the MS and it is provided to the multi-rate speech 
encoder. In other embodiments, the mode indicator may be transmitted in another 
channel, for example in a signalling channel. The bit rate of speech encoding in 
the MS is changed in response to this command. The multi-rate speech encoder 

10 provides a Mode Indication Mlu to the multi-rate channel encoder of the MS. The 
bit rate of channel encoding is changed in response to this indication. The bit rate 
changes to the multi-rate speech encoder and to the multi-rate channel encoder 
are linked so that the up-link channel gross bit rate is maintained. The mode 
indicator is then transmitted in-band across the air interface to the network and it is 

15 provided to the multi-rate channel decoder in the network. The bit rate of the 
channel decoding is changed in response to this indication. The multi-rate channel 
decoder provides the Mode Indication to the multi-rate speech decoder. The bit 
rate of the speech decoder is changed in response to this indication. The bit rate 
changes to the multi-rate channel decoder and to the multi-rate speech decoder 

20 match the operations performed in the encoder to enable correct decoding of the 
encoded signal. Following these changes, the speech signal to be transmitted by 
the MS is encoded in a suitable codec mode taking into consideration the speech 
signal and the channel quality. 

25 An example will now be given of an optimised decision which takes account of the 
channel quality, load of the network and the analysis of the actual speech signal in 
the up-link in the network. In this example, there is a full rate channel and three 
modes 1 , 2 and 3. Mode 1 is the most robust mode with a large amount of channel 
coding and mode 3 is the least robust mode with a small amount of channeL 

30 coding but the highest absolute quality in error free situations. Initially, in up-link 
mode 3 is used and the channel quality Is good. In this example it is assumed that 
there is a very high background noise level in the received signal, which makes it 
more difficult to encode the signal with the very low-bit-rate speech codec (such as 



wo 02/03601 



PCT/FIOl/00560 



mode 1 in this example) and it is also assumed tliat suddenly the channel quality 
degrades significantly. If the load of the network is low, the optimised decision in 
this situation will be made as follows: 1) The load will not affect the decision in this 
case, 2) When looking only at the channel quality threshold values of the different 
5 modes it would be the most optimal to change from mode 3 to the lowest mode 1 . 
However, because of the very high background noise level, the system will instead 
send a mode command to the mobile station commanding it to use mode 2 in 
order to preserve the quality during the high background noise at the expense of 
the error rate. The mode control unit is intelligent in that it is able to combine 
10 information from various sources in generating the mode command. 

In a system having centralised control such as the GSM AMR speech service, the 
MS obeys the Mode Command and, as far as operation of the MS is concemed, 
the fact that the speech signal characteristics have been used when selecting the 
15 most appropriate mode is irrelevant. Therefore, in a system according to the 
invention, no changes are needed in the MS for the invention to be implemented. 

Figure 3 shows a first embodiment of an implementation of the invention for 
providing down-link AMR control. It can be used in conjunction with the up-link 
20 AMR control arrangement described in relation to Figure 2. 

In a down-link direction, in the network an input signal is speech encoded in a 
multi-rate speech encoder, channel encoded in a multi-rate channel encoder and 
then transmitted across the air interface. In the MS the charinel encoded signal is 
25 decoded in a multi-rate channel decoder and then speech decoded in a multi-rate 
speech decoder to provide a decoded speech signal. 

In the MS, channel quality information is derived by estimating tiie current state of 
the down-link channel. The quality information is processed in a down-link quality 
30 estimation unit to produce a quality Indicator. The quality indicator is used to 
generate a Mode Request MRd in a down-link Mode Request generator in a link 
adaptation block in the MS. The down-link Mode Request generator compares the 
quality indicator with certain thresholds to generate the Mode Request. The Mode 



wo 02/03601 



PCT/FIOl/00560 



16 

Request Indicates the preferred codec mode for the down-link. The Mode Request 
. is transmitted in-band across the air interface to the networl< where it is fed into a . 
down-linic mode control unit. . 

5 In the network, at the Input to the multi-rate speech encoder, a frame of the input 
signal is taken and it is fed into a source adaptation block where speech signal 
analysis is carried out to determine the characteristics of the input signal. This 
input signal may come from a fixed telephone network, from another cellular 
network or from another MSG of the same network. It is fed into the down-link 
10 mode control unit. 

The down-link mode control unit receives a third input comprising network 
constraints. The down-link mode control unit takes the results of the speech signal 
analysis together with the channel quality from the down-link quality measurement 
15 unit and uses these inputs to make an optimised decision about which mode to 
use. This may involve changing the transmitted Mode Request so that it also uses 
the estimate of signal quality provided by the source adaptation block. The down- 
link mode control unit generates a Mode Command MCd indicating the codec 
mode to be used in the down-link path. 

20 

The channel quality measurement used is the same as those used in the down- 
link direction of a prior art system, such as the GSM AMR speech service using 
centralised control. 

25 The down-link mode control unit usually grants the Mode Request. However, the 
Mode Request may be overridden depending on possible constraints from network 
control. The requested codec mode is then applied for encoding of the incoming 
speech signal in a way similar to that described above for the up-link channel. 

30 Figure 4 shows a second embodiment of an implementation of the invention for 
providing down-link AMR control. It can be used in conjunction with the up-link 
AMR control arrangement described in relation to Figure 2. 
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The embodiment of Figure 4 differs from that of Rgure 3 in the way in which 
source control is applied by the networic to the IVIS. In the embodiment of Figure 4, 
the output of speech signal analysis is not fed to the down-link mode control unit 
but instead it is used in a threshold selection and update decision module, where it 
5 updates the channel quality thresholds used by the MS. The channel quality 
thresholds, if modified, are sent to the MS using system signalling. Initial values 
are sent at the beginning of call establishment. The thresholds are then modified 
later, for example periodically and during hand-over. In an embodiment where the 
invention is applied to a GSM AMR system, this is layer 3 signalling. 

10 

In the MS, the threshold up-date is applied directly to the down-link mode request 
generator and is used, together with the channel quality indicator, to generate the 
Mode Request. This involves applying source-controlled adaptation to unquantised 
channel quality estimates to control Mode Request generation. One example of 
15 this adjustment is that when the speech signal contains background noise, the 
decision to switch from a higher bit rate codec to a lower bit rate codec is made 
using a lower channel quality-threshold than in the case when there is less 
background noise in the signal. 

20 In the embodiment of Figure 4, operation of the down-link mode control unit is 
compatible with operation of the GSM AMR specifications since the thresholds are 
sent to the mobile station using conventional signalling, for example layer 3 
signalling. However, the thresholds used in the down-link Mode Request generator 
to generate the Mode Request (quantised channel quality) are changed to 

25 compensate for changes in the speech signal. 

The embodiment of Figure 3 has a faster adaptation to changes in source quality 
than that of Figure 4. In the former case, the Mode Command is manipulated 
directly and the modified Mode Command can be transmitted rapidly to the MS. In 
30 the latter case, relatively slow layer 3 signalling is used to change the thresholds 
relating to the ' generation of a mode request in the MS. However, the 
implementation of Figure 4 is more accurate because the unquantised channel 
measurements in the MS can be used and checked against the thresholds sent by 
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layer 3 signalling. In the implementation of Figure 3 quantised channel quality 
information (IVIode Request from the MS) is used in the network. 

According to the invention, it is possible to provide source-controlled adaptation in 
5 a system which utilises channel quality-controlled adaptation and to centralise 
control for both functions in the network. In particular, the invention may readily be 
applied to the GSM AMR speech sen/ice since no signalling changes are required 
within the network. 

10 Although the invention is described with reference to a GSM AMR system, it may 
apply to any wireless telecommunications system in which channel quality- 
controlled adaptation is centralised in the network. Furthermore, it should be noted 
that the invention does not necessarily have to be applied to a system having 
channel quality-controlled adaptation. It can be used in any network where it is 

1 5 desired to implement centrally controlled source quality adaptation. 

In the preceding description, the embodiments show network communicating with 
a single mobile station. Of course, the invention would be applied to a wireless 
telecommunications system in which a plurality of mobile stations communicate 
20 with a network. 

Particular implementations and embodiments of the invention have been 
described. It is clear to a person skilled in the art that the invention is not restricted 
to details of the embodiments presented above, but that It can be implemented in 
25 other embodiments using equivalent means without deviating from the 
characteristics of the invention. The scope of the invention is only restricted by the 
attached patent claims. 
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Claims 

1. A method of controlling adaptive multi-rate speech coding in a wireless 
telecommunications system comprising a mobile station and a networic, the 
method comprising the steps of : ^ . 

5 measuring speech characteristics of a decoded signal produced by a multi-rate 
decoder located in the network; 

forming a mode command on the basis of the speech characteristics; 
transmitting the mode command from the network to the mobile station; and 
using the mode command to control the speech encoding bit rate of a multi-rate 
1 0 speech encoder located in the mobile station. 

2. A method according to claim 1 which includes the step of producing a channel 
quality measurement. 

15 3. A method according to claim 2 which includes the step of combining the channel 
quality measurement and the speech characteristics to produce the mode 
command. 

4. A method according to any preceding claim in which the signal is a speech 
20 decoded signal. 

5. A method according to any preceding claim in which the signal comprises 
speech parameters from a channel decoded signal. 

25 6. A method according to any preceding claim in which a mode indication derived 
from the mode command is used to control the decoding bit rate of a multi-rate 
decoder in the network. 

7. A method according to any preceding claim in which a measurement is made of 
30 speech characteristics of ah input signal to a multi-rate speech encoder located in 
the network. 
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8. A method according to claim 7 in which a mode request is modified by the 
measurement of speech characteristics of the input signal to produce a down-link 
mode command. 

5 9. A wireless telecommunications system comprising a mobile station having a 
multi-rate speech encoder and a networic comprising a multi-rate decoder, a signal 
analyser which measures speech characteristics of a decoded signal produced by 
the multi-rate decoder to produce speech characteristics information and an up- 
link mode control unit which produces a mode command, the mode command 
10 being used to control the speech encoding bit rate of the multi-rate speech 
encoder in the mobile station. 

10. A network for controlling speech coding of a multi-rate speech encoder in a 
mobile station, the network comprising a multi-rate decoder, a signal analyser 
15 which measures speech characteristics of a decoded signal produced by the multi- 
rate decoder to produce speech characteristics information and an up-link mode 
control unit which produces a mode command, the mode command being used to 
control the speech encoding bit rate of a multi-rate speech encoder. 

20 11. A method of controlling adaptive multi-rate speech coding in a wireless 
telecommunications system comprising a mobile station and a network, the 
method comprising the steps of: 

measuring channel quality of a link from the networic to the mobile station; 
measuring speech characteristics of a signal which is to be transmitted from the 
25 networic to the mobile station; 

forming a mode command on the basis of the channel quality and the speech 
characteristics; 

using, the mode command to control the speech encoding bit rate of a multi-rate 
speech encoder located in the networic. 

30 

12. A method of controlling adaptive multi-rate speech coding in a wireless 
telecommunications system comprising a mobile station and a network, the 
method comprising the steps of: 
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measuring channel quality of a link from the network to the mobile station; 
measuring speech characteristics of a signal which is to be transmitted from the 
network to the mobile station; 

using the speech characteristics to up-date thresholds which are used to generate 
a mode request; 

forming a mode command on the basis of the channel quality and the thresholds; 
using the mode command to control the speech encoding bit rate of a multi-rate 
speech encoder located in the network. 
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